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Acronyms 


¢ Combinatorial logic (CL) ¢ Probability of configuration upsets 

* Commercial off the shelf (COTS) (P configuration) 

* Complementary metal-oxide ¢ Probability of Functional Logic upsets 
semiconductor (CMOS) (PiunctionalLogic) 

* Device under test (DUT) ¢ Probability of single event functional interrupt 

¢ Edge-triggered flip-flops (DFFs) (Pseri) 


* Probability of system failure (P.,<tem) 

e Processor (PC) 

¢ Radiation Effects and Analysis Group (REAG) 

¢ Reliability over time (R(t)) 

¢ Reliability over fluence (R(®)) 

¢ Single event effect (SEE) 

¢ Single event functional interrupt (SEFI) 

¢ Single event latch-up (SEL) 

¢ Single event transient (SET) 

e Single event upset (SEU) 

¢ Single event upset cross-section (O<-y) 

e System on a chip (SoC) 

¢ Windowed Shift Register (WSR) 

¢ Xilinx Virtex 5 field programmable gate array 
(V5) 

¢ Xilinx Virtex 5 field programmable gate array 
radiation hardened (V5QV) 
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¢ Electronic design automation (EDA) 
¢ Error rate (A) 

¢ Error rate per bit(A,;,) 

- Error rate per system(A,, om) 

¢ Field programmable gate array (FPGA) 

¢ Global triple modular redundancy (GTMR) 
¢ Hardware description language (HDL) 

e Input — output (I/O) 

¢ Intellectual Property (IP) 

e Linear energy transfer (LET) 

e Mean fluence to failure (MFTF) 

¢ Mean time to failure (MTTF) 

e Number of used bits (#Usedbits) 

¢ Operational frequency (fs) 

e Personal Computer (PC) 


Problem Statement 


Conventional methods of 
applying single event upset 
(SEU) data to complex systems 
need improvement. 


The problem boils down to 
extrapolation and application of 
SEU data to characterize system 
performance in radiation 
environments. 
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Abstract — Impact to spacarnlioniel S 


¢ We are investigating the application of « 
performance metrics combined with jtandaid : 


e We expect to relate SEU behavior to system performance 
requirements... 
— Our proposed methodology will provide better prediction of SEU 


responses in harsh radiation environments with confidence 
metrics. 
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e When a system is 
targeted for space, single 
event effect (SEE) data 
are obtained for all 
devices that make up that 
system. 


¢ Combining component 
data is not simple 
addition. 


« Co-dependent 
susceptibilities exist and 
must be handled 
accordingly. 
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Scope of Presentation S 


¢ Full system analysis requires combining SEU 
data for a variety of devices across a variety of 
boxes/mediums. 


¢ The scope of this presentation is for System-type 
SEU data analysis for a single device. 

¢ In this presentation, a System on a Chip (SoC) 
field programmable gate array (FPGA) device Is 
used as an example. 


e Future work will expand to address full systems. 


This presentation is a simplified approach for SEU data 
extrapolation to complex systems. Future work will 
incorporate details for a more realistic analysis. 
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Background (1) Ss 
FPGA SEU Susceptibility 


SEU Cross Section (05,-,) 
° Ocpys (per category) are calculated from SEU test and analysis. 


° Ocpys are calculated with particles that vary in linear energy 
transfer (LET). 

e FPGA architectures vary and so do their SEU responses. 

¢ Most believe the dominant o,,,s are per bit (configuration or flip- 
flops (DFFs)). However, global routes are significant (more than 
DFFs). 


Ospys are measured Ospys are measured 
xP bit by bit??? 
P (fs ) sein L Conf iguration +P (fs ) functionalLogic + SEI 
Design OseU Configuration Osceu Functional logic SEFI Oseu 


OsEU 


Sere Global Routes 
For a system, should o,,,s be Combinatorial and Hidden 


: logic (CL) in data 
measured by bit???? ars _ Logic 
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Conventional See craien. a el a Sections To 
Error Rates for Complex Systems First Step 


LET: Linear energy transfer 


e Perform SEU accelerated radiation testing across ions with different 
linear energy transfers (LETS) to calculate o,-ys per LET. 


Heavy lon ("Cosmic Ray") 


n-Substrate 


Epitaxial Layer (Lightly Doped)- F 


Substrate (Heavily Doped) = 
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Windowed Shift Register (WSR) Test 
Structures Are Used To Obtain SEU Data 


CL: combinatorial logic 


N levels of Inverters > > >. DFF = D flip flop 
between DFF stages: : 4-bit Window Output 
N = 0, 8, and 18 
Shift Register Chain 
SS —$—<$< i —___—______»- 
a a= a = a 
= ae a I me 


e Shift registers are typical test structures (mapped 
into device under test (DUT)) used for accelerated 
radiation testing. 


¢ Purpose is to analyze DFF and CL susceptibility. 
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Windowed Shift Register (WSR) Microsemi - Ss 
RTG4 Heavy lon Data at 10O0OMHz 


oc. Add combinatorial che Increase frequency may 
.00E-09 » increase cross section. or may not change SEU 


——_——? * gata. 


How and what you x 
test make a big 
5.00E-09 | difference! 


6.00E-09 
@WSR16 Checkerboard 


m@WSR8 Checkerboard 
4 WSR4 Checkerboard 


i XWSRO Checkerboard 
Q 4.00E-09 
< XWSRY1E All 1's 
S ase @®WSRS8 All 1's 
a =WSR4 All 1's 
6 -WSRO All 1's 
2.00E-09 —-WSR16 All 0's 
#WSR8 All 0's 
1.00E-09 BE WSR4 All 0's 
AWSRO All 0's 
0.00E+00 


0 5 10 15 20 25 
LET MeV*cm2/mg 
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What If Tests Do Not Investigate Test Ss 
Structures Across A Variety of Parameters 


Data might not reflect potential SEU responses! 


7.00E-09 - 
. 
6.00E-09 - = 
Which SEU cross sections raid i healmominbaletiiben 
5.00E-09 - be used?? Don’t want to niles 
_~ : 4 WSR4 Checkerboard 
nm overestimate and don’t want to 
G 4.00E-09 - underestimate. ee eee 
= XWSR16 All 1's 
S, sancti n @WSR8 All 1's 
y 5 =WSR4 All 1's 
© -WSRO All 1's 
200E 02 x —WSR16 All 0's 
#WSRS3 All 0's 
1.00E-09 - * e =WSR4 All 0's 
* i AWSRO All 0's 
0.00E+00 -—— L | a 
0 5 10 15 20 25 


LET MeV*cm2/m 
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Background (3) Sy 
Conventional Conversion of SEU Cross-Sections To 
Error Rates for Complex Systems Next Step 


Bottom-Up approach (transistor level): 

— Given occy (per bit) use an error rate calculator (such as CREME96) 
to obtain an error rate per bit (,,, )- 

— Multiply 2,,, by the number of used memory bits (#UsedBits) in the 
target design to attain a system error rate (),,.,..,)- Configuration and 
DFFs. 

Top-Down approach (system level): 
* Given Ocey (per system) use an error rate calculator (Such as 
CREME96) to obtain an error rate per bit (1,, 16m )- 


Heavy lon ("Cosmic Ray") 
0 


4 - +. 
Epitaxial Layer (Lightly Doped)- 7 ff 7 - 


Substrate (Heavily Doped) - +a + 
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Understand Goal of SEU Testing and Ss 
Data Application 


e Is the goal of SEU testing to analyze test circuits? 
— Efficacy of DFF mitigation. 
— Single event transient (SET) propagation strength. 
— SET width. 
— General test circuit evaluation. 
e Or... is the goal of testing to obtain data for eventual system 
characterization? 
e System characterization requires more than conventional test 
circuit analysis. 
— Test circuits are too simple. 


— Test circuits often do not follow formal design rules (e.g., 
synchronous, CMOS balancing, or place and route). 


— Design topology affects SEU response. 
e Complex system test structures are important for SEU system 
characterization. 
— Top down approach. 
— Multiple complex test structures and trend evaluation is essential. 


To be presented by Melanie Berg at the RADECS 2017 Radiation Effects on Components and Systems (RADECS) Conference, Geneva, Switzerland, October 6, 2017. 13 


Technical Problems with Current 
Methods of Error Rate Calculation 


¢ For submission to CREME96, Oscy 
data (in Log-linear form) are fitted to 
a Weibull curve. 


— The two main parameters for curve 
fitting are a shape factor andaslope = 1.00E-02 - 


1.00E-01 ; 


sy 

raetOr: — D 1.00E-03- 7 

— During the curve fitting process,a % 
large amount of error can be = 1.00E-04 | Data mimic wear- 
introduced. 7 . 5 1.00E-05 out portion of 

— Consequently, it is possible for ca - Weibull curve 
resultant error rates (for the same o 1.00E-06 - 
design) to vary by decades. 1.00E-07 

e Because of the error rate calculation 
1.00E-08 + 


process, O.<-y data are blended | 
together and it is nearly impossible 
to hone in on the problem spots. 
This can become important for 
mitigation insertion. 
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0.0 20.0 40.0 60.0 
LET MeV*cm2/mg 


Technical Problems with Bottom-Up 
Analysis Method (1) 


e Multiplying each bit within a design by 4,,, is 
not an efficient method of system error rate 
prediction. 


— Works well with memory structures... 
but...complex systems do not operate or 
respond like memories. 


— If an SEU affects a bit, and the bit is either 
inactive, disabled, or masked, a system 
malfunction might not occur. < Ay,.x#UsedBits 


A 
¢ Using the same multiplication factor eee 
across DFFs will produce extreme over- 
estimates. 


¢ To this date, there is no accurate method to predict system state 
activity for complex systems. 


¢ Fault injection or simulation will not determine frequency of 
activity. Any electronic design automation (EDA) tool supplier 
will agree. 
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Technical Problems with Bottom-Up Ss 
Analysis Method (2) 


e There are a variety of components that 
are susceptible to SEUs (clocks, resets, 
combinatorial logic, flip-flops (DFFs, 7 
etc...)). 

— Various component susceptibilities are not 
accurately characterized at a per bit level. 

— Design topology makes a significant 
difference in susceptibility and is not 
characterized in error rate calculators (e.g., 
CREME96). 

— Asaresult, there’s a lot of hand-waving 
during data extrapolation. 


Error rates calculated at the transistor-bit level are 
estimated at too small of granularity for proper 
extrapolation to complex systems. 
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Do 
ox 


¢ Classical reliability 
models have been used 
as a standard metric for 
complex system 
performance. 


¢ The analysis provides a 
more in depth 
interpretation of system 
behavior over time by 
using system-level MTTF 
data for system 
performance metrics. 


R@=e"™"' or R(t)=e" 
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Weibull Failure Rate (i(T)) Bathtub @y 
Curve 


oO 

E 

- |* j 

_ . Early Life i 

o. ‘ (failure rate decreases wi time) , 

W) : \Wearout Life —————+»/ 

S : (failure rate increases wy time) F 

— ° i 

£ : r 

2 7 f 

g / 

@ . Useful Life 2 

_ ra (failure rate approx. —, - 

= . r 

LL ‘. _* 

Fd 
Time : 

We will focus on the 
“Useful Life” of the 

Independent events bathtub curve for this 
analysis. 
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Mapping Classical Reliability Models from Sy 
The Time Domain To The Fluence Domain 


The exponential model that relates reliability to MTTF 
assumes that during useful-lifetime: 


— Failures are independent. R(=e™"F or R(Q=e” 
— Error rate is constant. Weibull slope <2 exponential. 
— MTTF = 1/1. , 

For a given LET (across fluence): Parallel between 
— SEUs are independent. time and fluence. 
— Ocpy IS constant. Oscy = #errors/fluence 
— MFTF = Io... A system = #errors/time 


Hence, mapping from the time domain to the fluence 
domain (per LET) is straight forward: 

—-to@® 

— MTTF © MFTF 


~h © copy R()=e*MTTF  &R(@) =e OMFIF 
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Creating Reliability Curves from o.-yS oS 


1.00E-02 

° Osey data are system level. € 1.00603 py “ 

¢ A histogram of environment o i ooeua 
data is created. Bins are = poe 0s : 
determined by LET values at 5 piete : 
each O,-y data point. re 

¢ For each data point at a given b enseaae | | 
LET, a combination of binned 0.0 20.0 40.0 60.0 
environment data and upper- pet ee 
bound o,-y data are used to 
determine system reliability 
performance. 


e A piecemeal approach is 
performed per data point to 
determine the weakest points ” §t00 Mile Al Shielding 
of system performance. 


Flux (#/cm?/day) > LET 


10° 10! 10° 
LET (MeV*cm2/mg) 
M. A. Xapsos, IEEE NSREC Short Course, Ponte Vedra 
Beach, FL, 2008. 
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Example of Proposed Methodology Ss 


a Application 
e Mission requirements: 


— The FPGA shall contain an embedded microprocessor. 

— Selection shall be made between a Xilinx V5QV (very 
expensive device) or a Xilinx V5 with embedded PowerPC 
(relatively cheap device). 

— FPGA operation shall have reliability of 3-nines (99.9%) 
within a 10 minute window at Geosynchronous Equatorial 
Orbit (GEO). 

¢ Proposed methodology: 

— Create a histogram of particle flux versus LET for a 10- 
minute window of time for your target environment. 

— Calculate MFTF per LET (obtain SEU data). 

— Graph R(®) for a variety of LET values and their associated 
MFTEs. R(®)=eOMFIF 


— For selected ranges of LETs, use an upper bound of particle 
flux (Number of particles/cm2*10-minutes), to determine if 
the system will meet the mission’s reliability requirements. 
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Environment Data: Flux versus LET 
Histogram for A 10-minute Window 
Geosynchronous Equatorial Orbit (GEO) 100-mils shielding 


L.0E+03 Bins are selected based on o,-,, data 


points. 


” 
@ 1.0E+02 
—_ 


= 1.0E+01 
= 1.0E+00 
1.0E-01 
1.0E-02 
1.0E-03 
1.0E-04 
1.0E-05 
1.0E-06 


Flux(particles/(cm2*10 


1.0E-07 
1.0E-08 


0T00.07 0.07 T0 0.14 0.14T01.8 1.8T03.6 3.6To20 20 To 40 40 and over 
LET Bins (MeVcm2/mg) 
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MFTFE versus LET for the Xilinx V5 Ss 
Embedded PowerPC Core and the Xilinx 
V5QV MicroBlaze Soft Processor Core 


1.00E+08 - 


° V5QV: no system errors MFTF = 1ocpy 
were observed below veou Meee 
LET=1.8MeVecm2/mg. enti plein iach 

with Cache Enabled 
Total fluence > 5.0x108 *% 
particles/cm2. * mV5: PowerPC 


1.00E+06- ss @ 
¢ PowerPC: 


— No system errors were 
observed below 
LET=0.07MeVecm2/mg 
with total fluence = 
1.0x10° particles/cm72. 


— Hence, at 0.07, we will jeoneies a ‘ 
assume an upper-bound ~ 
MFTF = 1.0x108 - 
particles/cm2. 1.00E+02 — : : 7 | 
— More tests would increase : - a io = ~~ 
the MFTF for this bin. sen Mevemmng 
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1.00E+05 - 


1.00E+04 | 


MFTF (particles/cm?2) 


Reliability across Fluence up to Ss 


LET=0.07MeVecm2/mg 


Binned GEO Environment data show approximately 3000 
particles/(cm2e10-minutes), in the range of 0.0MeVecm2/mg to 
0.07MeVecm2/mg. We are using MFTF for 0.07MeVecm2/mg to upper 


bound this bin. 


1.000000E+00 - 
9.999800E-01 - 
9.999600E-01 - 


Reliability 


9.998800E-01 - 
9.998600E-01 - 


9.998400E-01 


9.999400E-01 - 
9.999200E-01 - 
9.999000E-01 - 


0 


aaa PowerPC: MFTF = 1.0x108 


R(@)=e-%/1.0%10° 


Used MFTF= 1.0x108 because that was the 
maximum fluence for tests (no errors observed) 


TTTTITITITITITIIITITITIITItItiiririririrtrirrirrititirtitriitiriittrrrititriririiriritrririiritriitrtiirtitriiiitili 


1000 2000 3000 4000 5000 6000 7000 8000 9000 
Fluence (particles/cm?) 


Reliability at 3000 particles/(cm2*10-minutes) > 99.99% for the PowerPC 
design implementation. “9’s” could be increased with more tests. 
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Reliability across Fluence up to Ss 
LET=0.14MeVecm2/mg 


Binned GEO Environment data show approximately 11 
particles/(cm2e10-minutes), in the range of 0.07MeVecm2/mg to 
0.14MeVecm2/mg. We are using MFTF for 0.1MeVecm/2/mg to upper 
boundcbthia bu. 


9.999990E-01 _——™, ——=—PowerPC-V -5-0x105 
9.999980E-01 Oo 
en 


9.999950E-01 


Reliabilit 


9.999940E-01 D (db) — p- 8/5.0*10 


9.999930E-01 


9.999920E-01 
2.5 5 7.5 10 125 15 17.5 20 22.5 


oO 


Fluence (particles/cm?2) 


Reliability at 5 particles/(cm2*10-minutes) > 99.999% for the V5QV 
PowerPC design implementation. 
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Reliability across Fluence up to LET=1.8 Ss 
MeVecm2/mg 
Binned GEO Environment data show approximately 9 
particles/(cm2e10-minutes), in the range of 0.14MeVecm2/mg to 
1.8MeVecm2/mg. We are using MFTF for 1.8MeVecm2/mg to upper 
bound thPS°RFr°° 


9.999000E-01 arPC: M = 6.0x104 


/ 


9.998000E-01 


9.997000E-01 


: NS 
9.996000E-01 ee 
© 9.995000E-01 i 
9.994000E-01 ___We fall below 99.99% 
at approximately R(@)=e-0/6.0%1 04 


9.993000E-01 5particles/cm7! 


9.992000E-01 
0 4 8 12 16 20 24 28 


Fluence (particles/cm2) 
Reliability at 9 particles/(cm2*10-minutes) > 99.9% for the PowerPC 
design implementation. This is the most susceptible bin for the system. 
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Reliability across Fluence up to Ss 
LET=3.6MeVecm2/mg 


Binned GEO Environment data show approximately 0.23 
particles/(cm2e10-minutes), in the range of 1.8MeVecm2/mg to 
3.6MeVecm2/mg. 


1.00000E+00 : 

——. == V5QV: MFTF= 2.5x10° 
9.99950E-01 PowerPC: MFTF = 1.2x102 
9.99900E-01 


: ph 
‘2 9.99850E-01 
@ 
© 9.99800E-01 P 
R(@)=e- 92-510 
9.99750E-01 R(@) =e 9/1.2*10" 
9.99700E-01 
0 1 2 3 4 5 6 7 8 9 10 


Fluence (particle/cm2) 


Within this LET range, reliability at 0.23 particles/(cm2*10-minutes) 
> 99.999% for both design implementations. 
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Reliability across Fluence at Ss 
LET=40MeVcm2/mg 


Binned GEO environment data show approximately 0.07 
particles/(cm2e10-minutes), in the range of 3.6MeVecm2/mg to 


40.0MeVecmz2iitre 
feet Naat ais see ee 


0.9997 


Reliability 


0.9996 +e fall below 99.99% 
at approximatel 
0.02particles/cm?! R (D) =e 9/2-8x10" 


0.9995 R (>) =e-©/2.0* 104 


0.9994 
0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 


Fluence (particle/cm2) 
Within this LET range, reliability at 0.07 particles/(cm2e10-minutes) > 
99.9% for both design implementations. We can refine by analyzing 
smaller bins. 
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Example Conclusion ‘~ 


Using the proposed methodology, the commercial Xilinx 
V5 device will meet project requirements. 


In this case, the project is able to save money by 


selecting the significantly cheaper FPGA device and gain 
performance because of the embedded PowerPC. 
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Conclusions Nasal 


e This study transforms proven classical reliability models into the 
SEU particle fluence domain. The intent is to better characterize SEU 
responses for complex systems. 

e The method for reliability-model application is as follows: 

— SEU data are obtained as MFTF. 

— Reliability curves (in the fluence domain) are calculated using 
MFTF; and are analyzed with a piecemeal approach. 

— Environment data are then used to determine particle flux 
exposure within required windows of mission operation. 

e The proposed method does not rely on data-fitting and hence 
removes a significant source of error. 


¢ The proposed method provides information for highly SEU- 
susceptible scenarios; hence enables a better choice of mitigation 
strategy. 


e This is preliminary work. There is more to come. 


This methodology expresses SEU behavior and response in terms that 
missions understand via classical reliability metrics. 
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